Rank in Wordlist | Frequency | Word |
---|---|---|
14731 | 1 | 000,00 |
14827 | 1 | 16,23 |
15620 | 1 | Blanc,i |
15695 | 1 | Caroline,ibingahambi |
17607 | 1 | Motsoeneng,besisoloko |
18812 | 1 | Sotho,Shona |
20421 | 1 | abangama-30,000 |
20426 | 1 | abangama-8,000 |
21452 | 1 | amanzi,ngamakhulu |
21939 | 1 | asemthethweni,ukuze |
Rank in Wordlist | Frequency | Word |
---|---|---|
14785 | 1 | 1)(a |
14786 | 1 | 1)(c |
14845 | 1 | 196(4)(f)(ii |
14859 | 1 | 2(2 |
14860 | 1 | 2(b |
14929 | 1 | 3(b |
14948 | 1 | 33A(4 |
14954 | 1 | 37(1 |
14976 | 1 | 42(b |
14985 | 1 | 49(4 |
Rank in Wordlist | Frequency | Word |
---|---|---|
14785 | 1 | 1)(a |
14786 | 1 | 1)(c |
14787 | 1 | 1)Lilungelo |
14845 | 1 | 196(4)(f)(ii |
14945 | 1 | 32ha)Iyachuma |
14960 | 1 | 4)Uviwo |
15078 | 1 | A)ngumgadi |
15654 | 1 | Buchner@westerncape.gov.za). |
15678 | 1 | CPUT)'s |
17658 | 1 | NECT)South |
Rank in Wordlist | Frequency | Word |
---|---|---|
4809 | 4 | i-20% |
5739 | 3 | 50% |
5817 | 3 | I-20% |
6969 | 3 | kwe-50% |
10491 | 2 | i-25% |
10492 | 2 | i-60% |
12652 | 2 | ngu-100% |
14917 | 1 | 28% |
14938 | 1 | 30% |
14947 | 1 | 33% |
Rank in Wordlist | Frequency | Word |
---|---|---|
1598 | 12 | se-N&S |
12608 | 2 | ngokwe-N&S |
17651 | 1 | N&S |
18493 | 1 | SAB&T |
28696 | 1 | iH&M |
29263 | 1 | ii-T&S |
44787 | 1 | uH&M |
Rank in Wordlist | Frequency | Word |
---|---|---|
8303 | 2 | $1000 |
14730 | 1 | $500 |
19214 | 1 | US$41 |
Rank in Wordlist | Frequency | Word |
---|---|---|
421 | 36 | I'solezwe |
15678 | 1 | CPUT)'s |
18886 | 1 | Themba's |
19770 | 1 | WCED's |
24677 | 1 | candidate's |
37502 | 1 | naz'enethole. |
40448 | 1 | ngumth'uzimele |
47025 | 1 | uthath'indawo |
48789 | 1 | yangu-'present’ |
Rank in Wordlist | Frequency | Word |
---|---|---|
1415 | 13 | kunye/okanye |
2079 | 9 | http://wced |
6587 | 3 | http://curriculum |
6588 | 3 | https://www |
6779 | 3 | ka-2006/07 |
7414 | 3 | ngo-Apreli/Meyi |
8305 | 2 | 0078/2007 |
8331 | 2 | 84/1996 |
8631 | 2 | Irivyu/ukuhlolwa |
10490 | 2 | https://wcedonline |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots